NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Machine learning materials properties with accurate predictions, uncertainty estimates, domain guidance, and persistent online accessibility

https://doi.org/10.1088/2632-2153/ad95db

Jacobs, Ryan; Schultz, Lane_E; Scourtas, Aristana; Schmidt, KJ; Price-Skelly, Owen; Engler, Will; Foster, Ian; Blaiszik, Ben; Voyles, Paul_M; Morgan, Dane (December 2024, Machine Learning: Science and Technology)

Abstract One compelling vision of the future of materials discovery and design involves the use of machine learning (ML) models to predict materials properties and then rapidly find materials tailored for specific applications. However, realizing this vision requires both providing detailed uncertainty quantification (model prediction errors and domain of applicability) and making models readily usable. At present, it is common practice in the community to assess ML model performance only in terms of prediction accuracy (e.g. mean absolute error), while neglecting detailed uncertainty quantification and robust model accessibility and usability. Here, we demonstrate a practical method for realizing both uncertainty and accessibility features with a large set of models. We develop random forest ML models for 33 materials properties spanning an array of data sources (computational and experimental) and property types (electrical, mechanical, thermodynamic, etc). All models have calibrated ensemble error bars to quantify prediction uncertainty and domain of applicability guidance enabled by kernel-density-estimate-based feature distance measures. All data and models are publicly hosted on the Garden-AI infrastructure, which provides an easy-to-use, persistent interface for model dissemination that permits models to be invoked with only a few lines of Python code. We demonstrate the power of this approach by using our models to conduct a fully ML-based materials discovery exercise to search for new stable, highly active perovskite oxide catalyst materials.
more » « less
Benchmark Tests of Atom Segmentation Deep Learning Models with a Consistent Dataset

https://doi.org/10.1093/micmic/ozac043

Wei, Jingrui; Blaiszik, Ben; Scourtas, Aristana; Morgan, Dane; Voyles, Paul M (December 2022, Microscopy and Microanalysis)

Abstract The information content of atomic-resolution scanning transmission electron microscopy (STEM) images can often be reduced to a handful of parameters describing each atomic column, chief among which is the column position. Neural networks (NNs) are high performance, computationally efficient methods to automatically locate atomic columns in images, which has led to a profusion of NN models and associated training datasets. We have developed a benchmark dataset of simulated and experimental STEM images and used it to evaluate the performance of two sets of recent NN models for atom location in STEM images. Both models exhibit high performance for images of varying quality from several different crystal lattices. However, there are important differences in performance as a function of image quality, and both models perform poorly for images outside the training data, such as interfaces with large difference in background intensity. Both the benchmark dataset and the models are available using the Foundry service for dissemination, discovery, and reuse of machine learning models.
more » « less
Full Text Available
Foundry-ML - Software and Services to Simplify Accessto Machine Learning Datasets in Materials Science

https://doi.org/10.21105/joss.05467

Schmidt, KJ; Scourtas, Aristana; Ward, Logan; Wangen, Steve; Schwarting, Marcus; Darling, Isaac; Truelove, Ethan; Ambadkar, Aadit; Bose, Ribhav; Katok, Zoa; et al (January 2024, Journal of Open Source Software)

Full Text Available
Infrastructure for Analysis of Large Microscopy and Microanalysis Data Sets

https://doi.org/10.1017/s1431927622011539

Wei, Jingrui; Francis, Carter; Morgan, Dane; Schmidt, KJ; Scourtas, Aristana; Foster, Ian; Blaiszik, Ben; Voyles, Paul M (August 2022, Microscopy and Microanalysis)

Full Text Available
FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy

https://doi.org/10.1038/s41597-022-01712-9

Ravi, Nikil; Chaturvedi, Pranshu; Huerta, E. A.; Liu, Zhengchun; Chard, Ryan; Scourtas, Aristana; Schmidt, K. J.; Chard, Kyle; Blaiszik, Ben; Foster, Ian (November 2022, Scientific Data)

Abstract A concise and measurable set of FAIR (Findable, Accessible, Interoperable and Reusable) principles for scientific data is transforming the state-of-practice for data management and stewardship, supporting and enabling discovery and innovation. Learning from this initiative, and acknowledging the impact of artificial intelligence (AI) in the practice of science and engineering, we introduce a set of practical, concise, and measurable FAIR principles for AI models. We showcase how to create and share FAIR data and AI models within a unified computational framework combining the following elements: the Advanced Photon Source at Argonne National Laboratory, the Materials Data Facility, the Data and Learning Hub for Science, and funcX, and the Argonne Leadership Computing Facility (ALCF), in particular the ThetaGPU supercomputer and the SambaNova DataScale^®system at the ALCF AI Testbed. We describe how this domain-agnostic computational framework may be harnessed to enable autonomous AI-driven discovery.
more » « less
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon

https://doi.org/10.1039/d3dd00113j

Jablonka, Kevin Maik; Ai, Qianxiang; Al-Feghali, Alexander; Badhwar, Shruti; Bocarsly, Joshua D.; Bran, Andres M.; Bringuier, Stefan; Brinson, L. Catherine; Choudhary, Kamal; Circi, Defne; et al (August 2023, Digital Discovery)

Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications. The diverse topics and the fact that working prototypes could be generated in less than two days highlight that LLMs will profoundly impact the future of our fields. The rich collection of ideas and projects also indicates that the applications of LLMs are not limited to materials science and chemistry but offer potential benefits to a wide range of scientific disciplines.
more » « less
Full Text Available
Graph network based deep learning of bandgaps

https://doi.org/10.1063/5.0066009

Li, Xiang-Guo; Blaiszik, Ben; Schwarting, Marcus Emory; Jacobs, Ryan; Scourtas, Aristana; Schmidt, K. J.; Voyles, Paul M.; Morgan, Dane (October 2021, The Journal of Chemical Physics)

Search for: All records